Method and apparatus for determining a context model for transform coefficient level entropy encoding and decoding

ABSTRACT

Provided are a method and apparatus for determining a context model for entropy encoding and decoding of a transformation coefficient. According to the method and apparatus, a context set index ctxset is obtained based on color component information of a transformation unit, a location of a current subset, and whether there is a significant transformation coefficient having a value greater than a first critical value in a previous subset, and a context offset c 1  is obtained based on a length of a previous transformation coefficient having consecutive 1s. Also, a context index ctxids for entropy encoding and decoding of a first critical value flag is determined based on the obtained context set index and the context offset.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a Continuation of U.S. application Ser. No.14/355,439 filed Apr. 30, 2014, which is a National Stage Applicationunder 35 U.S.C. §371 of International Application No. PCT/KR2012/009074filed on Oct. 31, 2012, and claims the benefit of U.S. ProvisionalApplication No. 61/553,668, filed on Oct. 31, 2011 and U.S. ProvisionalApplication No. 61/671,955 filed on Jul. 16, 2012 in the United StatesPatent and Trademark Office, the disclosures of which are incorporatedherein by reference in their entireties.

TECHNICAL FIELD

The present invention relates to encoding and decoding of video, andmore particularly, to a method and apparatus for selecting a contextmodel which is used in entropy encoding and decoding of size informationof a transformation coefficient.

BACKGROUND ART

According to image compression methods such as MPEG-1, MPEG-2, or MPEG-4H.264/MPEG-4 advanced video coding (AVC), an image is split into blockshaving a predetermined size, and then, residual data of the blocks isobtained by inter prediction or intra prediction. Residual data iscompressed by transformation, quantization, scanning, run length coding,and entropy coding. In entropy coding, a syntax element such as atransformation coefficient or a motion vector is entropy encoded tooutput a bit stream. At a decoder's end, a syntax element is extractedfrom a bit stream, and decoding is performed based on the extractedsyntax element.

DISCLOSURE Technical Problem

The technical problem to be solved by the present invention are removingunnecessary context models used in the entropy encoding and decoding ofa transformation coefficient level and simplifying to reduce memory costto store the context model.

The present invention also provides increasing the speed of an operationof selecting a context model and simplifying the operation withoutgreatly decreasing the entropy encoding and decoding performance.

Technical Solution

The present invention provides a method and apparatus for selecting acontext model which is used in entropy encoding and decoding of atransformation coefficient level, in which a context model used inentropy encoding and decoding of a transformation coefficient level isselected based on color information, a location of a subset including atransformation coefficient, and information of a continuous length of 1,or the like.

Advantageous Effects

According to the embodiments of the present invention, by using areduced number of context models, the use amount of a memory for storingcontext models may be reduced, and entropy encoding and decoding of thetransformation coefficient level may be performed without a greatdecrease in performance.

DESCRIPTION OF DRAWINGS

FIG. 1 is a block diagram of an apparatus for encoding a video,according to an embodiment of the present invention;

FIG. 2 is a block diagram of an apparatus for decoding a video,according to an embodiment of the present invention;

FIG. 3 is a diagram for describing a concept of coding units accordingto an embodiment of the present invention;

FIG. 4 is a block diagram of a video encoder based on coding unitshaving a hierarchical structure, according to an embodiment of thepresent invention;

FIG. 5 is a block diagram of a video decoder based on coding unitshaving a hierarchical structure, according to an embodiment of thepresent invention;

FIG. 6 is a diagram illustrating deeper coding units according todepths, and partitions, according to an embodiment of the presentinvention;

FIG. 7 is a diagram for describing a relationship between a coding unitand transformation units, according to an embodiment of the presentinvention;

FIG. 8 is a diagram for describing encoding information of coding unitscorresponding to a coded depth, according to an embodiment of thepresent invention;

FIG. 9 is a diagram of deeper coding units according to depths,according to an embodiment of the present invention;

FIGS. 10 through 12 are diagrams for describing a relationship betweencoding units, prediction units, and frequency transformation units,according to an embodiment of the present invention;

FIG. 13 is a diagram for describing a relationship between a codingunit, a prediction unit, and a transformation unit, according toencoding mode information of Table 1;

FIG. 14 is a flowchart illustrating an operation of entropy encoding anddecoding of transformation coefficient information included in atransformation unit, according to an embodiment of the presentinvention;

FIG. 15 illustrates subsets obtained by splitting a transformation unitaccording to an embodiment of the present invention;

FIG. 16 illustrates a subset included in the transformation unit of FIG.15, according to an embodiment of the present invention;

FIG. 17 illustrates a significant map corresponding to the subset ofFIG. 16;

FIG. 18 illustrates a first critical value flag corresponding to thesubset of FIG. 16;

FIG. 19 illustrates a second critical value flag corresponding to thesubset of FIG. 16;

FIG. 20 is a table showing transformation coefficients included in asubset illustrated in FIGS. 16 through 19 and transformation coefficientinformation that is entropy encoded and decoded;

FIG. 21A is a structural block diagram illustrating an entropy encodingapparatus according to an embodiment of the present invention;

FIG. 21B is a structural block diagram illustrating an entropy decodingapparatus according to an embodiment of the present invention;

FIG. 22 is a structural block diagram illustrating a context modeleraccording to an embodiment of the present invention;

FIG. 23 illustrates a plurality of context sets applied to atransformation unit of a luminance component and a plurality of contextsincluded in each context set, according to an embodiment of the presentinvention;

FIG. 24 illustrates a plurality of context sets applied to atransformation unit of a chrominance component and a plurality ofcontexts included in each context set, according to an embodiment of thepresent invention;

FIG. 25 is a flowchart illustrating a method of determining a contextmodel for entropy encoding and decoding of a transformation coefficientlevel, according to an embodiment of the present invention;

FIG. 26 is a detailed flowchart illustrating a method of determining acontext model for entropy encoding and decoding of a transformationcoefficient level, according to an embodiment of the present invention;

FIG. 27A illustrates a context set index ctxset for determining acontext set used in entropy encoding and decoding of a first criticalvalue flag Gtr1 flag and a second critical value flag Gtr2 flag of asignificant transformation coefficient of a luminance component and asignificant transformation coefficient of a chrominance componentaccording to an embodiment of the present invention;

FIG. 27B illustrates a context offset used in entropy encoding anddecoding of a first critical value flag Gtr1 flag and a second criticalvalue flag Ctr2 flag, according to an embodiment of the presentinvention;

FIG. 28 illustrates a table showing a context offset index c1 that isused in entropy encoding or decoding of the transformation coefficientsincluded in a subset and the transformation coefficient information thatis entropy encoded or decoded, of FIG. 20, according to an embodiment ofthe present invention; and

FIG. 29 illustrates a table showing a context offset index c1 used inentropy encoding and decoding of transformation coefficients included ina subset and transformation coefficient information that is entropyencoded or decoded, according to another embodiment of the presentinvention.

BEST MODE

According to an aspect of the present invention, there is provided amethod of determining a context model for entropy encoding and decodingof a transformation coefficient level, the method comprising: splittinga transformation unit into subsets having a predetermined size andobtaining a significant transformation coefficient that is included ineach of the subsets and is not 0; obtaining a context set index fordetermining a context set used in entropy encoding and decoding of afirst critical value flag indicating whether the significanttransformation coefficient from among a plurality of context setsincluding a plurality of contexts has a value greater than a firstcritical value, based on color component information of a transformationunit, location information of a first subset in which the significanttransformation coefficient is included, and whether there is asignificant transformation coefficient having a value greater than thefirst critical value in a second subset that is processed before thefirst subset; obtaining a context offset for determining one of aplurality of contexts included in a context set used in entropy encodingand decoding of the first critical value flag based on a length of aprevious transformation coefficient having consecutive 1s; and obtaininga context index indicating a context used in entropy encoding ordecoding of the first critical value flag by using the context set indexand the context offset.

According to another aspect of the present invention, there is providedan apparatus for determining a context model for entropy encoding anddecoding of a transformation coefficient level, comprising: a mappingunit that splits a transformation unit into subsets having apredetermined size and obtains a significant transformation coefficientthat is included in each of the subsets and is not 0; a context setobtaining unit obtaining a context set index for determining a contextset used in entropy encoding and decoding of a first critical value flagindicating whether the significant transformation coefficient from amonga plurality of context sets including a plurality of contexts has avalue greater than a first critical value, based on color componentinformation of a transformation unit, location information of a firstsubset in which the significant transformation coefficient is included,and whether there is a significant transformation coefficient having avalue greater than the first critical value in a second subset that isprocessed before the first subset; a context offset obtaining unitobtaining a context offset for determining one of a plurality ofcontexts included in a context set used in entropy encoding and decodingof the first critical value flag based on a length of a previoustransformation coefficient having consecutive 1s; and a contextdetermining unit obtaining a context index indicating a context used inentropy encoding or decoding of the first critical value flag by usingthe context set index and the context offset.

MODE FOR INVENTION

Hereinafter, an “image” described in various embodiments of the presentapplication may be an inclusive concept referring to not only a stillimage but a video image.

When various operations are performed on data related to an image, thedata related to the image is split into data groups, and the sameoperation may be performed on data included in the same data group. Inthis specification, a data group formed according to predeterminedstandards is referred to as a ‘data unit’ Hereinafter, an operationperformed on each ‘data unit’ is understood as performed using dataincluded in a data unit.

Hereinafter, a method and apparatus for encoding and decoding of videoin which a syntax element having a tree structure is encoded or decodedbased on a coding unit having a hierarchical tree structure, accordingto an embodiment of the present invention, will be described withreference to FIGS. 1 through 13. In addition, the method of entropyencoding and decoding used in the encoding and decoding of videodescribed with reference to FIGS. 1 through 13 will be described indetail with reference to FIGS. 14 through 29.

FIG. 1 is a block diagram of a video encoding apparatus 100 according toan embodiment of the present invention.

The video encoding apparatus 100 includes a hierarchical encoder 110 andan entropy encoder 120.

The hierarchical encoder 110 may split a current picture to be encoded,in units of predetermined data units to perform encoding on each of thedata units. In detail, the hierarchical encoder 110 may split a currentpicture based on a maximum coding unit, which is a coding unit of amaximum size. The maximum coding unit according to an embodiment of thepresent invention may be a data unit having a size of 32×32, 64×64,128×128, 256×256, etc., wherein a shape of the data unit is a squarewhich has a width and length in squares of 2 and is greater than 8.

A coding unit according to an embodiment of the present invention may becharacterized by a maximum size and a depth. The depth denotes thenumber of times the coding unit is spatially split from the maximumcoding unit, and as the depth deepens, deeper encoding units accordingto depths may be split from the maximum coding unit to a minimum codingunit. A depth of the maximum coding unit is an uppermost depth and adepth of the minimum coding unit is a lowermost depth. Since a size of acoding unit corresponding to each depth decreases as the depth of themaximum coding unit deepens, a coding unit corresponding to an upperdepth may include a plurality of coding units corresponding to lowerdepths.

As described above, image data of the current picture is split into themaximum coding units according to a maximum size of the coding unit, andeach of the maximum coding units may include deeper coding units thatare split according to depths. Since the maximum coding unit accordingto an embodiment of the present invention is split according to depths,the image data of a spatial domain included in the maximum coding unitmay be hierarchically classified according to depths.

A maximum depth and a maximum size of a coding unit, which limit thetotal number of times a height and a width of the maximum coding unitare hierarchically split, may be predetermined.

The hierarchical encoder 110 encodes at least one split region obtainedby splitting a region of the maximum coding unit according to depths,and determines a depth to output finally encoded image data according tothe at least one split region. In other words, the hierarchical encoder110 determines a coded depth by encoding the image data in the deepercoding units according to depths, according to the maximum coding unitof the current picture, and selecting a depth having the least encodingerror. The determined coded depth and the encoded image data accordingto maximum encoding units are output to the entropy encoder 120.

The image data in the maximum coding unit is encoded based on the deepercoding units corresponding to at least one depth equal to or smallerthan the maximum depth, and results of encoding the image data arecompared based on each of the deeper coding units. A depth having theleast encoding error may be selected after comparing encoding errors ofthe deeper coding units. At least one coded depth may be selected foreach maximum coding unit.

The size of the maximum coding unit is split as a coding unit ishierarchically split according to depths and as the number of codingunits increases. Also, even if coding units correspond to a same depthin one maximum coding unit, it is determined whether to split each ofthe coding units corresponding to the same depth to a lower depth bymeasuring an encoding error of the image data of each coding unit,separately. Accordingly, even when image data is included in one maximumcoding unit, the image data is split into regions according to thedepths, and the encoding errors may differ according to regions in theone maximum coding unit, and thus the coded depths may differ accordingto regions in the image data. Thus, one or more coded depths may bedetermined in one maximum coding unit, and the image data of the maximumcoding unit may be divided according to coding units of at least onecoded depth.

Accordingly, the hierarchical encoder 110 may determine coding unitshaving a tree structure included in the maximum coding unit. The ‘codingunits having a tree structure’ according to an embodiment of the presentinvention include coding units corresponding to a depth determined to bethe coded depth, from among all deeper coding units included in themaximum coding unit. A coding unit having a coded depth may behierarchically determined according to depths in the same region of themaximum coding unit, and may be independently determined in differentregions. Similarly, a coded depth in a current region may beindependently determined from a coded depth in another region.

A maximum depth according to an embodiment of the present invention isan index related to the number of times splitting is performed from amaximum coding unit to a minimum coding unit. A first maximum depthaccording to an embodiment of the present invention may denote the totalnumber of times splitting is performed from the maximum coding unit tothe minimum coding unit. A second maximum depth according to anembodiment of the present invention may denote the total number of depthlevels from the maximum coding unit to the minimum coding unit. Forexample, when a depth of the maximum coding unit is 0, a depth of acoding unit, in which the maximum coding unit is split once, may be setto 1, and a depth of a coding unit, in which the maximum coding unit issplit twice, may be set to 2. Here, if the minimum coding unit is acoding unit in which the maximum coding unit is split four times, fivedepth levels of depths 0, 1, 2, 3, and 4 exist, and thus the firstmaximum depth may be set to 4, and the second maximum depth may be setto 5.

Prediction encoding and transformation may be performed according to themaximum coding unit. The prediction encoding and the transformation arealso performed based on the deeper coding units according to a depthequal to or depths less than the maximum depth, according to the maximumcoding unit.

Since the number of deeper coding units increases whenever the maximumcoding unit is split according to depths, encoding including theprediction encoding and the transformation is performed on all of thedeeper coding units generated as the depth deepens. For convenience ofdescription, the prediction encoding and the transformation will now bedescribed based on a coding unit of a current depth, in a maximum codingunit.

The video encoding apparatus 100 may variously select a size or shape ofa data unit for encoding the image data. In order to encode the imagedata, operations, such as prediction encoding, transformation, andentropy encoding, are performed, and at this time, the same data unitmay be used for all operations or different data units may be used foreach operation.

For example, the video encoding apparatus 100 may select not only acoding unit for encoding the image data, but also a data unit differentfrom the coding unit so as to perform the prediction encoding on theimage data in the coding unit.

In order to perform prediction encoding in the maximum coding unit, theprediction encoding may be performed based on a coding unitcorresponding to a coded depth, i.e., based on a coding unit that is nolonger split into coding units corresponding to a lower depth.Hereinafter, the coding unit that is no longer split and becomes a basisunit for prediction encoding will now be referred to as a ‘predictionunit’. A partition obtained by splitting the prediction unit may includea prediction unit or a data unit obtained by splitting at least one of aheight and a width of the prediction unit.

For example, when a coding unit of 2N×2N (where N is a positive integer)is no longer split and becomes a prediction unit of 2N×2N, a size of apartition may be 2N×2N, 2N×N, N×2N, or N×N. Examples of a partition typeinclude symmetrical partitions that are obtained by symmetricallysplitting a height or width of the prediction unit, partitions obtainedby asymmetrically splitting the height or width of the prediction unit,such as 1:n or n:1, partitions that are obtained by geometricallysplitting the prediction unit, and partitions having arbitrary shapes.

A prediction mode of the prediction unit may be at least one of an intramode, an inter mode, and a skip mode. For example, the intra mode or theinter mode may be performed on the partition of 2N×2N, 2N×N, N×2N, orN×N. Also, the skip mode may be performed only on the partition of2N×2N. The encoding is independently performed on one prediction unit ina coding unit, thereby selecting a prediction mode having the leastencoding error.

The video encoding apparatus 100 may also perform the transformation onthe image data in a coding unit based not only on the coding unit forencoding the image data, but also based on a data unit that is differentfrom the coding unit.

In order to perform the transformation in the coding unit, thetransformation may be performed based on a data unit having a sizesmaller than or equal to the coding unit. For example, the data unit forthe transformation may include a data unit for an intra mode and a dataunit for an inter mode.

A data unit used as a base of the transformation will now be referred toas a ‘transformation unit’. Similarly to the coding unit, thetransformation unit in the coding unit may be recursively split intosmaller sized regions, so that the transformation unit may be determinedindependently in units of regions. Thus, residual data in the codingunit may be divided according to the transformation unit having the treestructure according to transformation depths.

A transformation depth indicating the number of times splitting isperformed to reach the transformation unit by splitting the height andwidth of the coding unit may also be set in the transformation unit. Forexample, in a current coding unit of 2N×2N, a transformation depth maybe 0 when the size of a transformation unit is 2N×2N, may be 1 when thesize of a transformation unit is N×N, and may be 2 when the size of atransformation unit is N/2×N/2. That is, the transformation unit havingthe tree structure may also be set according to transformation depths.

Encoding information according to coding units corresponding to a codeddepth requires not only information about the coded depth, but alsoabout information related to prediction encoding and transformation.Accordingly, the hierarchical encoder 110 not only determines a codeddepth having the least encoding error, but also determines a partitiontype in a prediction unit, a prediction mode according to predictionunits, and a size of a transformation unit for transformation.

Coding units according to a tree structure in a maximum coding unit anda method of determining a partition, according to embodiments of thepresent invention, will be described in detail later with reference toFIGS. 3 through 12.

The hierarchical encoder 110 may measure an encoding error of deepercoding units according to depths by using Rate-Distortion Optimizationbased on Lagrangian multipliers.

The entropy encoder 120 outputs the image data of the maximum codingunit, which is encoded based on the at least one coded depth determinedby the hierarchical encoder 110, and information about the encoding modeaccording to the coded depth, in bit streams. The encoded image data maybe a coding result of residual data of an image. The information aboutthe encoding mode according to the coded depth may include informationabout the coded depth, information about the partition type in theprediction unit, prediction mode information, and size information ofthe transformation unit. In particular, as will be described later, theentropy encoder 120 may obtain a context set index indicating one of aplurality of context sets based on whether a significant transformationcoefficient having a greater value than a first critical value exists incolor component information of a transformation unit, a location of acurrent subset, and a previous subset, so as to obtain a context offsetbased on a length of a previous transformation coefficient havingconsecutive 1s. In addition, the entropy encoder 120 determines acontext index ctdldx indicating a context model that is to be applied toa first critical value flag Greaterthan1 flag indicating whether thesignificant transformation coefficient is greater than a first criticalvalue, that is, 1, and a second critical value flag Greaterthan2 flagindicating whether the significant transformation coefficient is greaterthan a second critical value, that is, 2, based on the obtained contextset index and the obtained context offset. The operation of determininga context model for entropy encoding of transformation coefficients tobe performed by the entropy encoder 120 will be described later.

The information about the coded depth may be defined by using splitinformation according to depths, which indicates whether encoding isperformed on coding units of a lower depth instead of a current depth.If the current depth of the current coding unit is the coded depth,image data in the current coding unit is encoded and output, and thusthe split information may be defined not to split the current codingunit to a lower depth. Alternatively, if the current depth of thecurrent coding unit is not the coded depth, the encoding is performed onthe coding unit of the lower depth, and thus the split information maybe defined to split the current coding unit to obtain the coding unitsof the lower depth.

If the current depth is not the coded depth, encoding is performed onthe coding unit that is split into the coding unit of the lower depth.Since at least one coding unit of the lower depth exists in one codingunit of the current depth, the encoding is repeatedly performed on eachcoding unit of the lower depth, and thus the encoding may be recursivelyperformed for the coding units having the same depth.

Since the coding units having a tree structure are determined for onemaximum coding unit, and information about at least one encoding mode isdetermined for a coding unit of a coded depth, information about atleast one encoding mode may be determined for one maximum coding unit.Also, a coded depth of the image data of the maximum coding unit may bedifferent according to locations since the image data is hierarchicallysplit according to depths, and thus information about the coded depthand the encoding mode may be set for the image data.

Accordingly, the entropy encoder 120 may assign encoding informationabout a corresponding coded depth and an encoding mode to at least oneof the coding unit, the prediction unit, and a minimum unit included inthe maximum coding unit.

The minimum unit according to an embodiment of the present invention isa square-shaped data unit obtained by splitting the minimum coding unitconstituting the lowermost depth by 4. Alternatively, the minimum unitmay be a maximum square-shaped data unit that may be included in all ofthe coding units, prediction units, partition units, and transformationunits included in the maximum coding unit.

For example, the encoding information output through the entropy encoder120 may be classified into encoding information according to deepercoding units according to depths and encoding information according toprediction units. The encoding information according to the deepercoding units according to depths may include the information about theprediction mode and about the size of the partitions. The encodinginformation according to the prediction units may include informationabout an estimated direction of an inter mode, about a reference imageindex of the inter mode, about a motion vector, about a chroma componentof an intra mode, and about an interpolation method of the intra mode.Also, information about a maximum size of the coding unit definedaccording to pictures, slices, or GOPs, and information about a maximumdepth may be inserted into a header of a bit stream.

In the video encoding apparatus 100, the deeper coding unit may be acoding unit obtained by dividing a height or width of a coding unit ofan upper depth, which is one layer above, by two. In other words, whenthe size of the coding unit of the current depth is 2N×2N, the size ofthe coding unit of the lower depth is N×N. Also, the coding unit of thecurrent depth having the size of 2N×2N may include a maximum number offour coding units of the lower depth.

Accordingly, the video encoding apparatus 100 may form the coding unitshaving the tree structure by determining coding units having an optimumshape and an optimum size for each maximum coding unit, based on thesize of the maximum coding unit and the maximum depth determinedconsidering characteristics of the current picture. Also, since encodingmay be performed on each maximum coding unit by using any one of variousprediction modes and transformations, an optimum encoding mode may bedetermined considering characteristics of the coding unit of variousimage sizes.

Thus, if an image having a high resolution or a large data amount isencoded in a conventional macroblock, a number of macroblocks perpicture excessively increases. Accordingly, a number of pieces ofcompressed information generated for each macroblock increases, and thusit is difficult to transmit the compressed information and datacompression efficiency decreases. Therefore, by using the video encodingapparatus 100, image compression efficiency may be increased since acoding unit is adjusted while considering characteristics of an imagewhile increasing a maximum size of a coding unit while considering asize of the image.

FIG. 2 is a block diagram of a video decoding apparatus 200 according toan embodiment of the present invention.

The video decoding apparatus 200 includes a syntax element extractingunit 210, an entropy decoder 220, and a hierarchical decoder 230.Definitions of various terms, such as a coding unit, a depth, aprediction unit, a transformation unit, and information about variousencoding modes, for various operations of the video decoding apparatus200 are identical to those described with reference to FIG. 1 and thevideo encoding apparatus 100.

The syntax element extracting unit 210 receives and parses a bitstreamof an encoded video. The entropy decoder 220 extracts encoded image datafor each coding unit from the parsed bitstream, wherein the coding unitshave a tree structure according to each maximum coding unit, and outputsthe extracted image data to the hierarchical decoder 230.

Also, the entropy decoder 220 extracts information about a coded depth,an encoding mode, color component information, prediction modeinformation, etc. for the coding units having a tree structure accordingto each maximum coding unit, from the parsed bitstream. The extractedinformation about the coded depth and the encoding mode is output to thehierarchical decoder 230. The image data in a bit stream is split intothe maximum coding unit so that the hierarchical decoder 230 may decodethe image data for each maximum coding unit.

The information about the coded depth and the encoding mode according tothe maximum coding unit may be set for information about at least onecoding unit corresponding to the coded depth, and information about anencoding mode may include information about a partition type of acorresponding coding unit corresponding to the coded depth, about aprediction mode, and a size of a transformation unit. Also, splittinginformation according to depths may be extracted as the informationabout the coded depth.

The information about the coded depth and the encoding mode according toeach maximum coding unit extracted by the entropy decoder 220 isinformation about a coded depth and an encoding mode determined togenerate a minimum encoding error when an encoder, such as the videoencoding apparatus 100, repeatedly performs encoding for each deepercoding unit according to depths according to each maximum coding unit.Accordingly, the video decoding apparatus 200 may restore an image bydecoding the image data according to a coded depth and an encoding modethat generates the minimum encoding error.

Since encoding information about the coded depth and the encoding modemay be assigned to a predetermined data unit from among a correspondingcoding unit, a prediction unit, and a minimum unit, the entropy decoder220 may extract the information about the coded depth and the encodingmode according to the predetermined data units. The predetermined dataunits to which the same information about the coded depth and theencoding mode is assigned may be inferred to be the data units includedin the same maximum coding unit.

Also, as will be described later, the entropy decoder 220 may obtain acontext set index indicating one of a plurality of context sets based onwhether a significant transformation coefficient having a greater valuethan a first critical value exists in color component information of atransformation unit, a location of a current subset, and a previoussubset, so as to obtain a context offset based on a length of a previoustransformation coefficient having consecutive 1s. In addition, theentropy decoder 220 determines a context index ctdldx indicating acontext model that is to be applied to a first critical value flagGreaterthan1 flag indicating whether the significant transformationcoefficient is greater than a first critical value, that is, 1, and asecond critical value flag Greaterthan2 flag indicating whether thesignificant transformation coefficient is greater than a second criticalvalue, that is, 2, based on the obtained context set index and theobtained context offset.

The hierarchical decoder 230 restores the current picture by decodingthe image data in each maximum coding unit based on the informationabout the coded depth and the encoding mode according to the maximumcoding units. In other words, the hierarchical decoder 230 may decodethe encoded image data based on the extracted information about thepartition type, the prediction mode, and the transformation unit foreach coding unit from among the coding units having the tree structureincluded in each maximum coding unit. A decoding process may includeprediction including intra prediction and motion compensation, andinverse transformation.

The hierarchical decoder 230 may perform intra prediction or motioncompensation according to a partition and a prediction mode of eachcoding unit, based on the information about the partition type and theprediction mode of the prediction unit of the coding unit according tocoded depths.

Also, the hierarchical decoder 230 may perform inverse transformationaccording to each transformation unit in the coding unit, based on theinformation about the size of the transformation unit of the coding unitaccording to coded depths, so as to perform the inverse transformationaccording to maximum coding units.

The hierarchical decoder 230 may determine at least one coded depth of acurrent maximum coding unit by using split information according todepths. If the split information indicates that image data is no longersplit in the current depth, the current depth is a coded depth.Accordingly, the hierarchical decoder 230 may decode the coding unit ofthe current depth with respect to the image data of the current maximumcoding unit by using the information about the partition type of theprediction unit, the prediction mode, and the size of the transformationunit.

In other words, data units containing the encoding information includingthe same split information may be gathered by observing the encodinginformation set assigned for the predetermined data unit from among thecoding unit, the prediction unit, and the minimum unit, and the gathereddata units may be considered to be one data unit to be decoded by thehierarchical decoder 230 in the same encoding mode.

The video decoding apparatus 200 may obtain information about at leastone coding unit that generates the minimum encoding error when encodingis recursively performed for each maximum coding unit, and may use theinformation to decode the current picture. In other words, encoded imagedata of the coding units having the tree structure determined to be theoptimum coding units in each maximum coding unit may be decoded.

Accordingly, even if image data has a high resolution and a large amountof data, the image data may be efficiently decoded and restored by usinga size of a coding unit and an encoding mode, which are adaptivelydetermined according to characteristics of the image data, by usinginformation about an optimum encoding mode received from an encoder.

A method of determining coding units having a tree structure, aprediction unit, and a transformation unit, according to an embodimentof the present invention, will now be described with reference to FIGS.3 through 13.

FIG. 3 is a diagram for describing a concept of coding units accordingto an embodiment of the present invention.

A size of a coding unit may be expressed in width×height, and may be64×64, 32×32, 16×16, and 8×8. A coding unit of 64×64 may be split intopartitions of 64×64, 64×32, 32×64, or 32×32; and a coding unit of 32×32may be split into partitions of 32×32, 32×16, 16×32, or 16×16; a codingunit of 16×16 may be split into partitions of 16×16, 16×8, 8×16, or 8×8;and a coding unit of 8×8 may be split into partitions of 8×8, 8×4, 4×8,or 4×4.

In video data 310, a resolution is 1920×1080, a maximum size of a codingunit is 64, and a maximum depth is 2. In video data 320, a resolution is1920×1080, a maximum size of a coding unit is 64, and a maximum depth is3. In video data 330, a resolution is 352×288, a maximum size of acoding unit is 16, and a maximum depth is 1. The maximum depth shown inFIG. 3 denotes a total number of splits from a maximum coding unit to aminimum coding unit.

If a resolution is high or a data amount is large, a maximum size of acoding unit may be large so as to not only increase encoding efficiencybut also to accurately reflect characteristics of an image. Accordingly,the maximum size of the coding unit of the video data 310 and 320 havingthe higher resolution than the video data 330 may be 64.

Since the maximum depth of the video data 310 is 2, coding units 315 ofthe vide data 310 may include a maximum coding unit having a long axissize of 64, and coding units having long axis sizes of 32 and 16 sincedepths are deepened to two layers by splitting the maximum coding unittwice. Meanwhile, since the maximum depth of the video data 330 is 1,coding units 335 of the video data 330 may include a maximum coding unithaving a long axis size of 16, and coding units having a long axis sizeof 8 since depths are deepened to one layer by splitting the maximumcoding unit once.

Since the maximum depth of the video data 320 is 3, coding units 325 ofthe video data 320 may include a maximum coding unit having a long axissize of 64, and coding units having long axis sizes of 32, 16, and 8since the depths are deepened to 3 layers by splitting the maximumcoding unit three times. As a depth deepens, detailed information may beprecisely expressed.

FIG. 4 is a block diagram of a video encoder 400 based on coding unitshaving a hierarchical structure, according to an embodiment of thepresent invention.

An intra predictor 410 performs intra prediction on coding units in anintra mode, with respect to a current frame 405, and a motion estimator420 and a motion compensator 425 respectively perform inter estimationand motion compensation on coding units in an inter mode by using thecurrent frame 405 and a reference frame 495.

Data output from the intra predictor 410, the motion estimator 420, andthe motion compensator 425 is output as a quantized transformationcoefficient through a transformer 430 and a quantizer 440. The quantizedtransformation coefficient is restored as data in a spatial domainthrough an inverse quantizer 460 and an inverse transformer 470, and therestored data in the spatial domain is output as the reference frame 495after being post-processed through a deblocking unit 480 and a loopfiltering unit 490. The quantized transformation coefficient may beoutput as a bitstream 455 through an entropy encoder 450.

When encoding syntax elements of a transformation unit such as a firstcritical value flag Gtr1 flag or a second critical value flag Gtr2 flag,the entropy encoder 450 obtains a context set index based on whetherthere is a significant transformation coefficient having a greater valuethan a first critical value in color component information of atransformation unit, a location of a current subset, and a previoussubset, obtains a context offset based on a length of a previoustransformation coefficient having consecutive 1s, and determines acontext index indicating a context model based on the obtained contextset index and the obtained context offset.

In order for the video encoder 400 to be applied in the video encodingapparatus 100, all elements of the video encoder 400, i.e., the intrapredictor 410, the motion estimator 420, the motion compensator 425, thetransformer 430, the quantizer 440, the entropy encoder 450, the inversequantizer 460, the inverse transformer 470, the deblocking unit 480, andthe loop filtering unit 490, perform operations based on each codingunit from among coding units having a tree structure while consideringthe maximum depth of each maximum coding unit.

Specifically, the intra predictor 410, the motion estimator 420, and themotion compensator 425 determine partitions and a prediction mode ofeach coding unit from among the coding units having a tree structurewhile considering the maximum size and the maximum depth of a currentmaximum coding unit, and the transformer 430 determines the size of thetransformation unit in each coding unit from among the coding unitshaving a tree structure.

FIG. 5 is a block diagram of a video decoder 500 based on coding units,according to an embodiment of the present invention.

A parser 510 parses encoded image data to be decoded and informationabout encoding required for decoding, from a bitstream 505. The encodedimage data is output as inverse quantized data through an entropydecoder 520 and an inverse quantizer 530, and the inverse quantized datais restored to image data in a spatial domain through an inversetransformer 540.

An intra predictor 550 performs intra prediction on coding units in anintra mode with respect to the image data in the spatial domain, and amotion compensator 560 performs motion compensation on coding units inan inter mode by using a reference frame 585.

The image data in the spatial domain, which passed through the intrapredictor 550 and the motion compensator 560, may be output as arestored frame 595 after being post-processed through a deblocking unit570 and a loop filtering unit 580. Also, the image data, which ispost-processed through the deblocking unit 570 and the loop filteringunit 580, may be output as the reference frame 585.

In order for the video decoder 500 to be applied in the video decodingapparatus 200, all elements of the video decoder 500, i.e., the parser510, the entropy decoder 520, the inverse quantizer 530, the inversetransformer 540, the intra predictor 550, the motion compensator 560,the deblocking unit 570, and the loop filtering unit 580, performoperations based on coding units having a tree structure for eachmaximum coding unit.

The intra predictor 550 and the motion compensator 560 determine apartition and a prediction mode for each coding unit having a treestructure, and the inverse transformer 540 has to determine a size of atransformation unit for each coding unit. Also, when decoding syntaxelements of a transformation unit such as a first critical value flagGtr1 flag or a second critical value flag Gtr2 flag, the entropy decoder520 obtains a context set index based on whether there is a significanttransformation coefficient having a greater value than a first criticalvalue in color component information of a transformation unit, alocation of a current subset, and a previous subset, obtains a contextoffset based on a length of a previous transformation coefficient havingconsecutive 1s, and determines a context index indicating a contextmodel based on the obtained context set index and the obtained contextoffset.

FIG. 6 is a diagram illustrating deeper coding units according todepths, and partitions, according to an embodiment of the presentinvention.

The video encoding apparatus 100 and the video decoding apparatus 200use hierarchical coding units so as to consider characteristics of animage. A maximum height, a maximum width, and a maximum depth of codingunits may be adaptively determined according to the characteristics ofthe image, or may be differently set by a user. Sizes of deeper codingunits according to depths may be determined according to thepredetermined maximum size of the coding unit.

In a hierarchical structure 600 of coding units according to anembodiment of the present invention, the maximum height and the maximumwidth of the coding units are each 64, and the maximum depth is 4. Sincea depth deepens along a vertical axis of the hierarchical structure 600,a height and a width of the deeper coding unit are each split. Also, aprediction unit and partitions, which are bases for prediction encodingof each deeper coding unit, are shown along a horizontal axis of thehierarchical structure 600.

In other words, a coding unit 610 is a maximum coding unit in thehierarchical structure 600, wherein a depth is 0 and a size, i.e., aheight by width, is 64×64. The depth deepens along the vertical axis,and a coding unit 620 having a size of 32×32 and a depth of 1, a codingunit 630 having a size of 16×16 and a depth of 2, a coding unit 640having a size of 8×8 and a depth of 3, and a coding unit 650 having asize of 4×4 and a depth of 4 exist. The coding unit 650 having the sizeof 4×4 and the depth of 4 is a minimum coding unit.

The prediction unit and the partitions of a coding unit are arrangedalong the horizontal axis according to each depth. In other words, ifthe coding unit 610 having the size of 64×64 and the depth of 0 is aprediction unit, the prediction unit may be split into partitionsincluded in the encoding unit 610, i.e. a partition 610 having a size of64×64, partitions 612 having the size of 64×32, partitions 614 havingthe size of 32×64, or partitions 616 having the size of 32×32.

Similarly, a prediction unit of the coding unit 620 having the size of32×32 and the depth of 1 may be split into partitions included in thecoding unit 620, i.e. a partition 620 having a size of 32×32, partitions622 having a size of 32×16, partitions 624 having a size of 16×32, andpartitions 626 having a size of 16×16.

Similarly, a prediction unit of the coding unit 630 having the size of16×16 and the depth of 2 may be split into partitions included in thecoding unit 630, i.e. a partition having a size of 16×16 included in thecoding unit 630, partitions 632 having a size of 16×8, partitions 634having a size of 8×16, and partitions 636 having a size of 8×8.

Similarly, a prediction unit of the coding unit 640 having the size of8×8 and the depth of 3 may be split into partitions included in thecoding unit 640, i.e. a partition having a size of 8×8 included in thecoding unit 640, partitions 642 having a size of 8×4, partitions 644having a size of 4×8, and partitions 646 having a size of 4×4.

The coding unit 650 having the size of 4×4 and the depth of 4 is theminimum coding unit and a coding unit of the lowermost depth. Aprediction unit of the coding unit 650 is only assigned to a partitionhaving a size of 4×4.

In order to determine the at least one coded depth of the coding unitsconstituting the maximum coding unit 610, the encoding unit determiningunit (120) of the video encoding apparatus 100 performs encoding forcoding units corresponding to each depth included in the maximum codingunit 610.

The number of deeper coding units according to depths including data inthe same range and the same size increases as the depth deepens. Forexample, four coding units corresponding to a depth of 2 are required tocover data that is included in one coding unit corresponding to a depthof 1. Accordingly, in order to compare encoding results of the same dataaccording to depths, the coding unit corresponding to the depth of 1 andfour coding units corresponding to the depth of 2 are each encoded.

In order to perform encoding for a current depth from among the depths,a least encoding error may be selected for the current depth byperforming encoding for each prediction unit in the coding unitscorresponding to the current depth, along the horizontal axis of thehierarchical structure 600. Alternatively, the minimum encoding errormay be searched for by comparing the least encoding errors according todepths and performing encoding for each depth as the depth deepens alongthe vertical axis of the hierarchical structure 600. A depth and apartition having the minimum encoding error in the coding unit 610 maybe selected as the coded depth and a partition type of the coding unit610.

FIG. 7 is a diagram for describing a relationship between a coding unit710 and transformation units 720, according to an embodiment of thepresent invention.

The video encoding apparatus 100 or the video decoding apparatus 200encodes or decodes an image according to coding units having sizessmaller than or equal to a maximum coding unit for each maximum codingunit. Sizes of transformation units for transformation during encodingmay be selected based on data units that are not larger than acorresponding coding unit.

For example, in the video encoding apparatus 100 or the video decodingapparatus 200, if a size of the coding unit 710 is 64×64, transformationmay be performed by using the transformation units 720 having a size of32×32.

Also, data of the coding unit 710 having the size of 64×64 may beencoded by performing the transformation on each of the transformationunits having the size of 32×32, 16×16, 8×8, and 4×4, which are smallerthan 64×64, and then a transformation unit having the least coding errormay be selected.

FIG. 8 is a diagram for describing encoding information of coding unitscorresponding to a coded depth, according to an embodiment of thepresent invention.

An output unit 130 of the video encoding apparatus 100 may encode andtransmit information 800 about a partition type, information 810 about aprediction mode, and information 820 about a size of a transformationunit for each coding unit corresponding to a coded depth, as informationabout an encoding mode.

The information 800 indicates information about a shape of a partitionobtained by splitting a prediction unit of a current coding unit,wherein the partition is a data unit for prediction encoding the currentcoding unit. For example, a current coding unit CU_0 having a size of2N×2N may be split into any one of a partition 802 having a size of2N×2N, a partition 804 having a size of 2N×N, a partition 806 having asize of N×2N, and a partition 808 having a size of N×N. Here, theinformation 800 about a partition type is set to indicate one of thepartition 804 having a size of 2N×N, the partition 806 having a size ofN×2N, and the partition 808 having a size of N×N

The information 810 indicates a prediction mode of each partition. Forexample, the information 810 may indicate a mode of prediction encodingperformed on a partition indicated by the information 800, i.e., anintra mode 812, an inter mode 814, or a skip mode 816.

The information 820 indicates a transformation unit to be based on whentransformation is performed on a current coding unit. For example, thetransformation unit may be a first intra transformation unit 822, asecond intra transformation unit 824, a first inter transformation unit826, or a second inter transformation unit 828.

The entropy decoder 220 of the video decoding apparatus 200 may extractand use the information 800, 810, and 820 for decoding, according toeach deeper coding unit.

FIG. 9 is a diagram of deeper coding units according to depths,according to an embodiment of the present invention.

Split information may be used to indicate a change of a depth. The spiltinformation indicates whether a coding unit of a current depth is splitinto coding units of a lower depth.

A prediction unit 910 for prediction encoding of a coding unit 900having a depth of 0 and a size of 2N_(—)0×2N_(—)0 may include partitionsof a partition type 912 having a size of 2N_(—)0×2N_(—)0, a partitiontype 914 having a size of 2N_(—)0×N_(—)0, a partition type 916 having asize of N_(—)0×2N_(—)0, and a partition type 918 having a size ofN_(—)0×N_(—)0. FIG. 9 only illustrates the partition types 912 through918 which are obtained by symmetrically splitting the prediction unit910, but a partition type is not limited thereto, and the partitions ofthe prediction unit 910 may include asymmetrical partitions, partitionshaving a predetermined shape, and partitions having a geometrical shape.

Prediction encoding is repeatedly performed on one partition having asize of 2N_(—)0×2N_(—)0, two partitions having a size of 2N_(—)0×N_(—)0,two partitions having a size of N_(—)0×2N_(—)0, and four partitionshaving a size of N_(—)0×N_(—)0, according to each partition type. Theprediction encoding in an intra mode and an inter mode may be performedon the partitions having the sizes of 2N_(—)0×2N_(—)0, N_(—)0×2N_(—)0,2N_(—)0×N_(—)0, and N_(—)0×N_(—)0. The prediction encoding in a skipmode is performed only on the partition having the size of2N_(—)0×2N_(—)0.

If an encoding error is the smallest in one of the partition types 912through 916 having the sizes of 2N_(—)0×2N_(—)0, 2N_(—)0×N_(—)0, andN_(—)0×2N_(—)0, the prediction unit 910 may not be split into a lowerdepth.

If the encoding error is the smallest in the partition type 918 havingthe size of N_(—)0×N_(—)0, a depth is changed from 0 to 1 to split thepartition type 918 in operation 920, and encoding is repeatedlyperformed on partition type coding units having a depth of 2 and a sizeof N_(—)0×N_(—)0 to search for a minimum encoding error.

A prediction unit 940 for prediction encoding of the (partition type)coding unit 930 having a depth of 1 and a size of 2N_(—)1×2N_(—)1(=N_(—)0×N_(—)0) may include partitions of a partition type 942 having asize of 2N_(—)1×2N_(—)1, a partition type 944 having a size of2N_(—)1×N_(—)1, a partition type 946 having a size of N_(—)1×2N_(—)1,and a partition type 948 having a size of N_(—)1×N_(—)1.

If an encoding error is the smallest in the partition type 948 havingthe size of N_(—)1×N_(—)1, a depth is changed from 1 to 2 to split thepartition type 948 in operation 950, and encoding is repeatedlyperformed on coding units 960, which have a depth of 2 and a size ofN_(—)2×N_(—)2 to search for a minimum encoding error.

When a maximum depth is d, a split operation according to each depth maybe performed up to when a depth becomes d-1, and split information maybe encoded as up to when a depth is one of 0 to d-2. In other words,when encoding is performed up to when the depth is d-1 after a codingunit corresponding to a depth of d-2 is split in operation 970, aprediction unit 990 for prediction encoding a coding unit 980 having adepth of d-1 and a size of 2N_(d-1)×2N_(d-1) may include partitions of apartition type 992 having a size of 2N_(d-1)×2N_(d-1), a partition type994 having a size of 2N_(d-1)×N_(d-1), a partition type 996 having asize of N_(d-1)×2N_(d-1), and a partition type 998 having a size ofN_(d-1)×N_(d-1).

Prediction encoding may be repeatedly performed on one partition havinga size of 2N_(d-1)×2N_(d-1), two partitions having a size of2N_(d-1)×N_(d-1), two partitions having a size of N_(d-1)×2N_(d-1), fourpartitions having a size of N_(d-1)×N_(d-1) from among the partitiontypes 992 through 998 to search for a partition type having a minimumencoding error.

Even when the partition type 998 having the size of N_(d-1)×N_(d-1) hasthe minimum encoding error, since a maximum depth is d, a coding unitCU_(d-1) having a depth of d-1 is no longer split to a lower depth, anda coded depth for the coding units constituting the current maximumcoding unit 900 is determined to be d-1 and a partition type of thecurrent maximum coding unit 900 may be determined to be N_(d-1)×N_(d-1).Also, since the maximum depth is d, split information for the encodingunit 952 of the depth d-1 is not set.

A data unit 999 may be a ‘minimum unit’ for the current maximum codingunit. A minimum unit according to an embodiment of the present inventionmay be a rectangular data unit obtained by splitting the minimum codingunit 980 by 4. By performing the encoding repeatedly, the video encodingapparatus 100 may select a depth having the least encoding error bycomparing encoding errors according to depths of the coding unit 900 todetermine a coded depth, and set a corresponding partition type and aprediction mode as an encoding mode of the coded depth.

As such, the minimum encoding errors according to depths are compared inall of the depths of 1 through d, and a depth having the least encodingerror may be determined as a coded depth. The coded depth, the partitiontype of the prediction unit, and the prediction mode may be encoded andtransmitted as information about an encoding mode. Also, since a codingunit is split from a depth of 0 to a coded depth, only split informationof the coded depth is set to 0, and split information of depthsexcluding the coded depth is set to 1.

The entropy decoder 220 of the video decoding apparatus 200 may extractand use the information about the coded depth and the prediction unit ofthe coding unit 900 to decode the coding unit 912. The video decodingapparatus 200 may determine a depth, in which split information is 0, asa coded depth by using split information according to depths, and useinformation about an encoding mode of the corresponding depth fordecoding.

FIGS. 10 through 12 are diagrams for describing a relationship betweencoding units 1010, prediction units 1060, and transformation units 1070according to an embodiment of the present invention.

The coding units 1010 are coding units having a tree structure,corresponding to coded depths determined by the video encoding apparatus100, in a maximum coding unit. The prediction units 1060 are partitionsof prediction units of each of the coding units 1010, and thetransformation units 1070 are transformation units of each of the codingunits 1010.

When a depth of a maximum coding unit is 0 in the coding units 1010,depths of coding units 1012 and 1054 are 1, depths of coding units 1014,1016, 1018, 1028, 1050, and 1052 are 2, depths of coding units 1020,1022, 1024, 1026, 1030, 1032, and 1048 are 3, and depths of coding units1040, 1042, 1044, and 1046 are 4.

In the prediction units 1060, some coding units 1014, 1016, 1022, 1032,1048, 1050, 1052, and 1054 are obtained by splitting the coding units.In other words, partition types in the coding units 1014, 1022, 1050,and 1054 have a size of 2N×N, partition types in the coding units 1016,1048, and 1052 have a size of N×2N, and a partition type of the codingunit 1032 has a size of N×N. Prediction units and partitions of thecoding units 1010 are smaller than or equal to each coding unit.

Transformation or inverse transformation is performed on image data ofthe coding unit 1052 in the transformation units 1070 in a data unitthat is smaller than the coding unit 1052. Also, the coding units 1014,1016, 1022, 1032, 1048, 1050, 1052, and 1054 in the transformation units1070 are different from those in the prediction units 1060 in terms ofsizes and shapes. In other words, the video encoding apparatus 100 andthe video decoding apparatus 200 may perform intra prediction, motionestimation, motion compensation, transformation, and inversetransformation individually on a data unit in the same coding unit.

Accordingly, encoding is recursively performed on each of coding unitshaving a hierarchical structure in each region of a maximum coding unitto determine an optimum coding unit, and thus coding units having arecursive tree structure may be obtained. Encoding information mayinclude split information about a coding unit, information about apartition type, information about a prediction mode, and informationabout a size of a transformation unit. Table 1 shows the encodinginformation that may be set by the video encoding apparatus 100 and thevideo decoding apparatus 200.

TABLE 1 Split Information 0 (Encoding on Coding Unit having Size of 2N ×2N and Current Depth of d) Size of Partition Transformation Type UnitSplit Split Information Information Symmetrical Asymmetrical 0 of 1 ofSplit Prediction Partition Partition Transformation TransformationInformation Mode Type Type Unit Unit 1 Intra 2N × 2N 2N × nU 2N × 2N N ×N Repeatedly Inter 2N × N  2N × nD (Symmetrical Encode Skip  N × 2N nL ×2N Type) Coding (Only N × N nR × 2N N/2 × N/2 Units 2N × 2N)(Asymmetrical having Type) Lower Depth of d + 1

The entropy encoder 120 of the video encoding apparatus 100 may outputthe encoding information about the coding units having a tree structure,and the entropy decoder 220 of the video decoding apparatus 200 mayextract the encoding information about the coding units having a treestructure from a received bitstream.

Split information indicates whether a current coding unit is split intocoding units of a lower depth. If split information of a current depth dis 0, a depth, in which a current coding unit is no longer split into alower depth, is a coded depth, and thus information about a partitiontype, a prediction mode, and a size of a transformation unit may bedefined for the coded depth. If the current coding unit is further splitaccording to the split information, encoding is independently performedon four split coding units of a lower depth.

A prediction mode may be one of an intra mode, an inter mode, and a skipmode. The intra mode and the inter mode may be defined in all partitiontypes, and the skip mode is defined only in a partition type having asize of 2N×2N.

The information about the partition type may indicate symmetricalpartition types having sizes of 2N×2N, 2N×N, N×2N, and N×N, which areobtained by symmetrically splitting a height or a width of a predictionunit, and asymmetrical partition types having sizes of 2N×nU, 2N×nD,nL×2N, and nR×2N, which are obtained by asymmetrically splitting theheight or width of the prediction unit. The asymmetrical partition typeshaving the sizes of 2N×nU and 2N×nD may be respectively obtained bysplitting the height of the prediction unit in 1:n and n:1 (where n isan integer greater than 1), and the asymmetrical partition types havingthe sizes of nL×2N and nR×2N may be respectively obtained by splittingthe width of the prediction unit in 1:n and n:1.

The size of the transformation unit may be set to be two types in theintra mode and two types in the inter mode. In other words, if splitinformation of the transformation unit is 0, the size of thetransformation unit may be 2N×2N, which is the size of the currentcoding unit. If split information of the transformation unit is 1, thetransformation units may be obtained by splitting the current codingunit. Also, if a partition type of the current coding unit having thesize of 2N×2N is a symmetrical partition type, a size of atransformation unit may be N×N, and if the partition type of the currentcoding unit is an asymmetrical partition type, the size of thetransformation unit may be N/2×N/2.

The encoding information about coding units having a tree structure mayinclude at least one of a coding unit corresponding to a coded depth, aprediction unit, and a minimum unit. The coding unit corresponding tothe coded depth may include at least one of a prediction unit and aminimum unit containing the same encoding information.

Accordingly, it is determined whether adjacent data units are includedin the same coding unit corresponding to the coded depth by comparingencoding information of the adjacent data units. Also, a correspondingcoding unit corresponding to a coded depth is determined by usingencoding information of a data unit, and thus a distribution of codeddepths in a maximum coding unit may be determined.

Accordingly, if a current coding unit is predicted based on encodinginformation of adjacent data units, encoding information of data unitsin deeper coding units adjacent to the current coding unit may bedirectly referred to and used.

Alternatively, if a current coding unit is predicted based on encodinginformation of adjacent data units, data units adjacent to the currentcoding unit are searched using encoded information of the data units,and the searched adjacent coding units may be referred to for predictingthe current coding unit.

FIG. 13 is a diagram for describing a relationship between a codingunit, a prediction unit, and a transformation unit according to theencoding mode information of Table 1.

A maximum coding unit 1300 includes coding units 1302, 1304, 1306, 1312,1314, 1316, and 1318 of coded depths. Here, since the coding unit 1318is a coding unit of a coded depth, split information may be set to 0.Information about a partition type of the coding unit 1318 having a sizeof 2N×2N may be set to be one of a partition type 1322 having a size of2N×2N, a partition type 1324 having a size of 2N×N, a partition type1326 having a size of N×2N, a partition type 1328 having a size of N×N,a partition type 1332 having a size of 2N×nU, a partition type 1334having a size of 2N×nD, a partition type 1336 having a size of nL×2N,and a partition type 1338 having a size of nR×2N.

When the partition type is set to be symmetrical, i.e. the partitiontype 1322, 1324, 1326, or 1328, a transformation unit 1342 having a sizeof 2N×2N is set if split information (TU size flag) of a transformationunit is 0, and a transformation unit 1344 having a size of N×N is set ifa TU size flag is 1.

When the partition type is set to be asymmetrical, i.e., the partitiontype 1332, 1334, 1336, or 1338, a transformation unit 1352 having a sizeof 2N×2N is set if a TU size flag is 0, and a transformation unit 1354having a size of N/2×N/2 is set if a TU size flag is 1.

The TU size flag is a type of transformation index; a size of atransformation unit corresponding to a transformation index may bemodified according to a prediction unit type or a partition type of acoding unit.

When the partition type is set to be symmetrical, i.e. the partitiontype 1322, 1324, 1326, or 1328, the transformation unit 1342 having asize of 2N×2N is set if a TU size flag of a transformation unit is 0,and the transformation unit 1344 having a size of N×N is set if a TUsize flag is 1.

When the partition type is set to be asymmetrical, i.e., the partitiontype 1332 (2N×nU), 1334 (2N×nD), 1336 (nL×2N), or 1338 (nR×2N), thetransformation unit 1352 having a size of 2N×2N is set if a TU size flagis 0, and the transformation unit 1354 having a size of N/2×N/2 is setif a TU size flag is 1.

Referring to FIG. 13, the TU size flag described above is a flag havinga value of 0 or 1, but the TU size flag is not limited to 1 bit, and atransformation unit may be hierarchically split while the TU size flagincreases from 0. The transformation unit split information (TU sizeflag) may be used as an example of a transformation index.

In this case, when a TU size flag according to an embodiment is usedwith a maximum size and a minimum size of a transformation unit, thesize of the actually used transformation unit may be expressed. Thevideo encoding apparatus 100 may encode maximum transformation unit sizeinformation, minimum transformation unit size information, and maximumtransformation unit split information. The encoded maximumtransformation unit size information, minimum transformation unit sizeinformation, and maximum transformation unit split information may beinserted into a sequence parameter set (SPS). The video decodingapparatus 200 may use the maximum transformation unit size information,the minimum transformation unit size information, and the maximumtransformation unit split information for video decoding.

For example, (a) if a size of a current coding unit is 64×64 and amaximum transformation unit is 32×32, (a-1) a size of a transformationunit is 32×32 if a TU size flag is 0; (a-2) a size of a transformationunit is 16×16 if a TU size flag is 1; and (a-3) a size of atransformation unit is 8×8 if a TU size flag is 2.

Alternatively, (b) if a size of a current coding unit is 32×32 and aminimum transformation unit is 32×32, (b-1) a size of a transformationunit is 32×32 if a TU size flag is 0, and since the size of atransformation unit cannot be smaller than 32×32, no more TU size flagsmay be set.

Alternatively, (c) if a size of a current encoding unit is 64×64 and amaximum TU size flag is 1, a TU size flag may be 0 or 1 and no other TUsize flags may be set.

Accordingly, when defining a maximum TU size flag as‘MaxTransformSizeIndex’, a minimum TU size flag as ‘MinTransformSize’,and a transformation unit in the case when a TU size flag is 0, that is,a basic transformation unit RootTu as ‘RootTuSize’, a size of a minimumtransformation unit ‘CurrMinTuSize’, which is available in a currentcoding unit may be defined by Equation (1) below.

CurrMinTuSize=max(MinTransformSize,RootTuSize/(2̂MaxTransformSizeIndex))  (1)

In comparison with the size of the minimum transformation unit‘CurrMinTuSize’ that is available in the current coding unit, the basictransformation unit size ‘RootTuSize’, which is a size of atransformation unit when if a TU size flag is 0, may indicate a maximumtransformation unit which may be selected in regard to a system. Thatis, according to Equation (1), ‘RootTuSize/(2̂MaxTransformSizeIndex)’ isa size of a transformation unit that is obtained by splitting‘RootTuSize’, which is a size of a transformation unit whentransformation unit split information is 0, by the number of splittingtimes corresponding to the maximum transformation unit splitinformation, and ‘MinTransformSize’ is a size of a minimumtransformation unit, and thus a smaller value of these may be‘CurrMinTuSize’ which is the size of the minimum transformation unitthat is available in the current coding unit.

The size of the basic transformation unit ‘RootTuSize’ according to anembodiment of the present invention may vary according to a predictionmode.

For example, if a current prediction mode is an inter mode, RootTuSizemay be determined according to Equation (2) below. In Equation (2),‘MaxTransformSize’ refers to a maximum transformation unit size, and‘PUSize’ refers to a current prediction unit size.

RootTuSize=min(MaxTransformSize, PUSize)  (2)

In other words, if a current prediction mode is an inter mode, the sizeof the basic transformation unit size ‘RootTuSize’, which is atransformation unit if a TU size flag is 0, may be set to a smallervalue from among the maximum transformation unit size and the currentprediction unit size.

If a prediction mode of a current partition unit is an intra mode,‘RootTuSize’ may be determined according to Equation (3) below.‘PartitionSize’ refers to a size of the current partition unit.

RootTuSize=min(MaxTransformSize, PartitionSize)  (3)

In other words, if a current prediction mode is an intra mode, the basictransformation unit size ‘RootTuSize’ may be set to a smaller value fromamong the maximum transformation unit size and the current partitionunit size.

However, it should be noted that the size of the basic transformationunit size ‘RootTuSize’, which is the current maximum transformation unitsize according to an embodiment of the present invention and variesaccording to a prediction mode of a partition unit, is an example, andfactors for determining the current maximum transformation unit size arenot limited thereto.

Hereinafter, an entropy encoding operation of a syntax element, which isperformed in the entropy encoder 120 of the video encoding apparatus 100of FIG. 1, and an entropy decoding operation of a syntax element, whichis performed in the entropy decoder 220 of the video decoding apparatus200 of FIG. 2 will be described in detail.

As described above, the video encoding apparatus 100 and the videodecoding apparatus 200 perform encoding and decoding by splitting amaximum coding unit into coding units that are smaller than or equal toa maximum coding unit. A prediction unit and a transformation unit usedin prediction and transformation may be determined based on costsindependently from other data units. Since an optimum coding unit may bedetermined by recursively encoding each coding unit having ahierarchical structure included in the maximum coding unit, data unitshaving a tree structure may be configured. In other words, for eachmaximum coding unit, a coding unit having a tree structure, and aprediction unit and a transformation unit each having a tree structuremay be configured. For decoding, hierarchical information, which isinformation indicating structure information of data units having ahierarchical structure and non-hierarchical information for decoding,besides the hierarchical information, needs to be transmitted.

The information related to a hierarchical structure is informationneeded for determining a coding unit having a tree structure, aprediction unit having a tree structure, and a transformation unithaving a tree structure, as described above with reference to FIGS. 10through 12, and includes a transformation unit split flag (TU size flag)indicating a size of a maximum coding unit, coded depth, partitioninformation of a prediction unit, a split flag indicating whether acoding unit is split or not, size information of a transformation unit,and a transformation unit split flag (TU size flag) indicating whether atransformation unit is split or not. Examples of coding informationother than hierarchical structure information include prediction modeinformation of intra/inter prediction applied to each prediction unit,motion vector information, prediction direction information, colorcomponent information applied to each data unit in the case when aplurality of color components are used, and transformation coefficientinformation. Hereinafter, hierarchical information andextra-hierarchical information may be referred to as a syntax elementwhich is to be entropy encoded or entropy decoded.

In particular, according to the embodiments of the present invention, amethod of determining a context model for efficiently entropy encodingand decoding a level of a transformation coefficient, that is, sizeinformation of syntax elements is provided. Hereinafter, a method ofdetermining a context model for entropy encoding and decoding of a levelof a transformation coefficient will be described in detail.

FIG. 14 is a flowchart illustrating an operation of entropy encoding anddecoding of transformation coefficient information included in atransformation unit, according to an embodiment of the presentinvention.

Referring to FIG. 14, coded_block_flag indicating whether atransformation coefficient which is not 0 (hereinafter, referred to as“significant coefficient”) exists or not from among transformationcoefficients included in a current transformation unit is first entropyencoded or decoded in operation 1410.

If coded_block_flag is 0, there are only transformation coefficients of0 in the current transformation unit, and thus only a value 0 is entropyencoded or decoded as coded_block_flag, and transformation coefficientlevel information is not entropy encoded or decoded.

In operation 1420, if there is a significant coefficient in the currenttransformation unit, a significance map SigMap indicating a location ofa significant coefficient is entropy encoded or decoded.

A significance map SigMap may be formed of a significant bit andpredetermined information indicating a location of a last significancecoefficient. A significant bit indicates whether a transformationcoefficient according to each scan index is a significant coefficient or0, and may be expressed by significant_coeff_flag[i]. As will bedescribed later, a significance map is set in units of subsets having apredetermined size which is obtained by splitting a transformation unit.Accordingly, significant_coeff_flag[i] indicates whether atransformation coefficient of an i-th scan index from amongtransformation coefficients included in a subset included in atransformation unit is 0 or not.

According to the conventional H.264, a flag (End-Of-Block) indicatingwhether each significant coefficient is the last significant coefficientor not is additionally entropy encoded or decoded. However, according toan embodiment of the present invention, location information of the lastsignificant coefficient itself is entropy encoded or decoded. Asdescribed above with reference to FIGS. 1 through 13, the size of atransformation unit according to an embodiment of the present inventionis not limited to 4×4 but may also be a larger size such as 8×8, 16×16,or 32×32. It is inefficient to additionally entropy encode or decode aflag (End-Of-Block) indicating whether each significant coefficient isthe last significant coefficient since the size of the flag(End-Of-Block) increases. Accordingly, according to an embodiment of thepresent invention, the location information of the last significantcoefficient itself may be entropy coded or decoded. For example, if alocation of the last significant coefficient is (x, y), where x and yare integers, coordinate values of (x, y) may be entropy encoded ordecoded.

FIG. 15 illustrates subsets obtained by splitting a transformation unitaccording to an embodiment of the present invention. While a 16×16 sizeof a transformation unit 1500 is illustrated in FIG. 15, the size of thetransformation unit 1500 is not limited to 16×16 and may also be from4×4 to 32×32.

Referring to FIG. 15, for entropy encoding and decoding of atransformation coefficient included in the transformation unit 1500, thetransformation unit 1500 is split into subsets having a predeterminedsize. For example, as illustrated in FIG. 15, the transformation unit1500 may be split into subsets having a 4×4 size. The size of thesubsets is not limited to 4×4 and may be various.

As illustrated in FIG. 15, the transformation unit 1500 is split into 16subsets, and transformation coefficient information is entropy encodedor decoded for each subset unit. The transformation coefficientinformation that is entropy encoded or decoded may be, for example, afirst critical value flag (Greaterthan1 flag, hereinafter referred to as“Gtr1 flag” indicating whether a significant coefficient included in asubset has a greater value than a predetermined first critical valuesuch as 1, a second critical value flag (Greaterthan2 flag, hereinafterreferred to as “Gtr2 flag”) indicating whether a significant coefficientincluded in a subset has a greater value than a predetermined secondcritical value such as 2, or level-3 indicating a level of a significantcoefficient that is greater than a predetermined second critical value.Here, it is assumed that the first critical value is set to 1, and thesecond critical value is set to 2, but the first and second criticalvalues are not limited thereto and may be modified. The first criticalvalue flag (Gtr1 flag) is set only for a transformation coefficientincluding a significance map of 1, that is, only for a significantcoefficient, and is not set for a transformation coefficient of 0. Inaddition, the second critical value flag (Gtr2 flag) is set only for atransformation coefficient having a first critical value flag of 1.

In FIG. 15, a subset including the last significant coefficient isassumed as a subset 11 1510. Entropy encoding and decoding oftransformation coefficient information performed for each subset isperformed backwards according to a scanning order from the subset 111510 in which the last significant coefficient is included.

FIG. 16 illustrates a subset 1600 included in the transformation unit1500 of FIG. 15, according to an embodiment of the present invention.

Referring to FIGS. 15 and 16, any subset included in the transformationunit 1500 illustrated in FIG. 15 is assumed as including transformationcoefficients having a 4×4 size as illustrated in FIG. 16. As describedabove, according to an embodiment of the present invention, asignificance map SigMap, a first critical value flag Gtr1 flag, and asecond critical value flag Gtr2 flag are entropy encoded or decoded inunits of subsets.

FIG. 17 illustrates a significance map SigMap 1700 corresponding to thesubset 1600 of FIG. 16.

Referring to FIGS. 16 and 17, the significance map SigMap 1700 having avalue of 1 with respect to significant coefficients from amongtransformation coefficients which are included in the subset 1600 ofFIG. 16 and which do not have a value of 0 is set. The significance mapSigMap 1700 is entropy encoded or decoded using a predetermined contextmodel.

FIG. 18 illustrates a first critical value flag Gtr1 flag 1800corresponding to the subset 1600 of FIG. 16.

Referring to FIGS. 16 through 18, the first critical value flag Gtr1flag 1800 indicating whether a corresponding transformation coefficienthas a value greater than a first critical value, that is, 1, withrespect to transformation coefficients having a value of 1 is set. Ifthe critical value flag Gtr1 flag 1800 is 1, this indicates that acorresponding transformation coefficient has a value greater than 1, andif the critical value flag Gtr1 flag 1800 is 0, this indicates that acorresponding coefficient has a value of 1.

FIG. 19 illustrates a second critical value flag Gtr2 flag 1900corresponding to the subset 1600 of FIG. 16.

Referring to FIGS. 16 through 19, the second critical value flag Gtr2flag 1900 is set only with respect to a transformation coefficient forwhich the first critical value flag Gtr1 flag 1800 is set as 1 amongtransformation coefficients, and the second critical value flag Gtr2flag 1900 indicates whether a corresponding transformation coefficienthas a value greater than a second critical value, that is, 2.

Referring to FIG. 19, level information of a transformation coefficienthaving a second critical value flag Gtr2 flag 1900 of 1, that is,transformation coefficients having a value of 23 and 3 in FIG. 16,itself is entropy encoded or decoded. Here, a transformation coefficientincluding a second critical value flag Gtr2 flag 1900 of 1 is atransformation coefficient that has a greater value than 2, and thus, avalue obtained by subtracting 3 from a level of a correspondingtransformation coefficient (level-3) is encoded as level information ofa corresponding transformation coefficient. In the above-describedexample, when performing entropy encoding, 20 is encoded instead of 23,and 0 is encoded as level information instead of 3. When performingentropy decoding, level information of a transformation coefficient maybe restored by entropy decoding level-3 of a transformation coefficientincluding the second critical value flag Gtr2 flag 1900 of 1, and thenadding 3 thereto.

FIG. 20 is a table showing transformation coefficients included in thesubsets illustrated in FIGS. 16 through 19 and transformationcoefficient information that is entropy encoded or decoded. As describedabove, according to an embodiment of the present invention, asignificance map SigMap, a first critical value flag Gtr1 flag, a secondcritical value flag Gtr2 flag, and level information (level-3)indicating a location and level information of a significant coefficientare entropy encoded or decoded according to a predetermined scanningorder from the last significant coefficient in units of subsets.

FIG. 21A is a structural block diagram illustrating a structure of anentropy encoding apparatus 2100 according to an embodiment of thepresent invention. The entropy encoding apparatus 2100 of FIG. 21Acorresponds to the entropy encoder 120 of the video encoding apparatus100 of FIG. 1.

Referring to FIG. 21A, the entropy encoding apparatus 2100 includes abinarizer 2110, a context modeler 2120, and a binary arithmetic coder2130. Also, the binary arithmetic coder 2130 includes a regular codingengine 2132 and a bypass coding engine 2134.

Syntax elements that are input to the entropy encoding apparatus 2100may not be binary values, and thus, if the syntax elements are notbinary values, the binarizer 2110 binarizes the syntax elements tooutput a bin string consisting of binary values of 0 or 1. Bin denoteseach bit of a stream consisting of 0 or 1, and is encoded by contextadaptive binary arithmetic coding (CABAC). If a syntax element is dataincluding 0 and 1 at the same frequencies, the syntax element is outputto the bypass coding engine 2134, which does not use a probabilityvalue.

The context modeler 2120 provides a probability model about a currentencoding symbol to the regular coding engine 2132. In detail, thecontext modeler 2120 outputs probabilities of binary values for encodingbinary values of a current encoding symbol to the binary arithmeticcoder 2130. A current encoding symbol refers to each binary value whenan encoded current syntax element is formed by binarization, that is,when formed of a binary value.

A context model is a probability model with respect to a bin, andincludes information indicating which of 0 and 1 corresponds to a mostprobable symbol (MPS) and a least probable symbol (LPS) and aprobability of an MPS or an LPS. Hereinafter, a context model may besimply referred to as a context. Also, a context set refers to a setincluding a plurality of contexts.

The regular coding engine 1432 performs binary arithmetic encoding on acurrent encoding symbol based on information about an MPS or an LPS andprobability information of the MPS or the LPS provided by from thecontext modeler 1420.

As will be described later, in order to determine a context model forentropy encoding of a first critical value flag Gtr1 flag of a subset,the context modeler 2120 according to an embodiment of the presentinvention obtains a context set index ctxset for determining a contextset used in entropy encoding of the first critical value flag from amonga plurality of context sets including a plurality of contexts, based oncolor component information of a transformation unit, locationinformation of a subset indicating at which location a current subset islocated in the transformation unit, and whether there are significantcoefficients having a value greater than the first critical value in asubset that is processed before the current subset according to theorder of processing describe with respect to FIG. 15 above. In addition,the context modeler 2120 obtains a context offset c1 for determining oneof a plurality of contexts included in a context set used in entropyencoding of the first critical value flag Gtr1 flag based on a length ofa previous transformation coefficient having consecutive 1s. Also, thecontext modeler 2120 obtains a context index ctxIdx1 indicating acontext used in entropy encoding of the first critical value flag Gtr1flag by using the context set index ctxset and the context offset c1.When always entropy encoding or decoding the first critical value flagGtr1 flag of a transformation unit, a value of the context offset c1 ismaintained or modified, and thus, the context is always maintained orupdated every time when entropy encoding or decoding the first criticalvalue flag Gtr1 flag of the transformation unit.

Also, the context modeler 2120 obtains a context set index ctxsetindicating one of a plurality of context sets used in entropy encodingand decoding of a second critical value flag Gtr2 flag based on colorcomponent information of a transformation unit, location information ofa subset indicating at which location a current subset is located in thetransformation unit, and whether there are significant coefficientshaving a value greater than the first critical value in a subset that isprocessed before the current subset according to the order of processingdescribed with respect to FIG. 15 above. A parameter used to obtain thecontext set index ctxset used in entropy encoding and decoding of thesecond critical value flag Gtr2 flag is the same as a parameter used toobtain the context set index ctxset that is used in entropy encoding anddecoding of the first critical value flag Gtr1 flag. Accordingly, thecontext modeler 2120 may use the context set index ctxset fordetermining a context set used in entropy encoding of the first criticalvalue flag described above also when determining a context set forentropy encoding of the second critical value flag Gtr2 flag. A contextoffset c2 for determining one of a plurality of contexts included in acontext set that is used in entropy encoding of the second criticalvalue flag Gtr2 flag has a value of 0. Accordingly, a context indexctxIdx2 indicating a context used in entropy encoding of the secondcritical value flag Gtr2 flag is set to be the same as the context setindex ctxset of the first critical value flag Gtr1 flag.

FIG. 21B is a structural block diagram illustrating a structure of anentropy decoding apparatus 2150 according to an embodiment of thepresent invention. The entropy decoding apparatus 2150 of FIG. 21Bcorresponds to the entropy decoder 220 of the video decoding apparatus200 of FIG. 2. The entropy decoding apparatus 2150 performs a reverseoperation of entropy encoding performed in the entropy encodingapparatus 2100 described above.

Referring to FIG. 21B, the entropy decoding apparatus 2150 includes acontext modeler 2160, a regular decoding engine 2170, a bypass decoder2180, and a de-binarizer 2190.

A symbol that is encoded by bypass encoding is output to the bypassdecoder 2180 to be decoded, and a symbol encoded by regular encoding isdecoded by the regular decoding engine 2170. The regular decoding engine2170 arithmetically decodes a binary value of a current encoding symbolbased on a context model provided by the context modeler 2160.

The context modeler 2160 determines a context model for entropy decodingof a first critical value flag Gtr1 flag and a second critical valueflag Gtr2 flag in the same manner as the context modeler 2120 of FIG.21A described above. The context modeler 2160 of FIG. 21B determines acontext model for entropy decoding of a first critical value flag Gtr1flag and a second critical value flag Gtr2 flag in the same manner asthe context modeler 2120 of FIG. 21A described above with respect toencoding except that it operates with respect to decoding.

The de-binarizer 2340 restores bin strings that are restored in theregular decoding engine 2170 or the bypass decoder 2180 to syntaxelements.

Hereinafter, an operation of determining a context model for entropyencoding and decoding of a first critical value flag Gtr1 flag and asecond critical value flag Gtr2 flag conducted by using the contextmodelers 2120 and 2160 of FIGS. 21A and 21B will be described in detail.

FIG. 22 is a structural block diagram illustrating a context modeler2200 according to an embodiment of the present invention.

Referring to FIG. 22, the context modeler 2200 includes a mapping unit2210, a context set obtaining unit 2220, a context offset obtaining unit2230, and a context determining unit 2240.

The mapping unit 2210 obtains location information of a significantcoefficient included in a current subset that is entropy encoded ordecoded. When performing entropy encoding, the mapping unit 2210 mayobtain locations of significant coefficients from information oftransformation coefficients included in the current subset. Whenperforming entropy decoding, the mapping unit 2210 may obtain locationsof significant coefficients included in a subset from a significance mapSigMap.

The context set obtaining unit 2220 obtains a context set index ctxsetindicating one of a plurality of context sets including a plurality ofcontexts which are used in entropy encoding and decoding of a firstcritical value flag Gtr1 flag and a second critical value flag Gtr2 flagregarding a significant coefficient.

In detail, the context set obtaining unit 2220 obtains a context setindex for determining a context model used in entropy encoding anddecoding of a first critical value flag Gtr1 flag and a second criticalvalue flag Gtr2 flag based on color component information of atransformation unit, location information of a current subset that isbeing processed, and whether there are significant coefficients having avalue greater than a first critical value in a subset that is processedbefore the current subset.

The context offset obtaining unit 2230 determines a context offsetindicating one of a plurality of contexts included in the context setindex ctxset. When performing entropy encoding or decoding on the firstcritical value flag Gtr1 flag, the context offset c1 may be determinedbased on a length of a previous transformation coefficient havingconsecutive 1s before a current significant coefficient that isprocessed while significant coefficients included in the current subsetare being processed according to a predetermined scanning order. Acontext offset c2 of the second critical value flag Gtr2 flag has avalue of 0.

The context determining unit 2240 obtains a context index ctxIdx1indicating a context used in entropy encoding or decoding of a firstcritical value flag Gtr1 flag by using the context set index ctxset andthe context offset c1. For entropy encoding and decoding of the firstcritical value flag Gtr1 flag, when it is assumed that n context sets (nis an integer) are set, and n context sets have m contexts (m is aninteger), a total of n*m contexts may be used in entropy encoding ordecoding of a first critical value flag Gtr1 flag. When assuming that acontext set index ctxSet indicating one of n context sets is an integerfrom 0 to (n-1) and a context offset c1 indicating one of m contextoffsets is an integer from 0 to (m-1), a context index ctxIdx1indicating one of n*m contexts may be determined based on the followingequation: ctxIdx1=ctxSet*m+c1.

A context index ctxIdx2 indicating one context used in entropy encodingand decoding of a second critical value flag Gtr2 flag may be determinedbased on the following equation: ctxIdx2=ctxSet*1+c2. As describedabove, since c2 is 0, the context index ctxIdx2 indicating a contextused in entropy encoding and decoding of the second critical value flagGtr2 flag is determined based on only a value of a set index ctxSet.

FIG. 23 illustrates a plurality of context sets applied to atransformation unit of a luminance component and a plurality of contextsincluded in each context set according to an embodiment of the presentinvention. FIG. 24 illustrates a plurality of context sets applied to atransformation unit of a chrominance component and a plurality ofcontexts included in each context set according to an embodiment of thepresent invention.

Referring to FIG. 23, for entropy encoding and decoding of a firstcritical value flag Gtr1 flag with respect to a transformationcoefficient included in a luminance component subset, a context includedin any one of a first context set 2310, a second context set 2320, athird context set 2330, and a fourth context set 2340 is used. Differentcontexts that are included in any one of the four context sets may bedistinguished by a context offset c1 from 0 to 3, as illustrated in FIG.23. Accordingly, a context from among the total of 16 contexts that areto be applied to the first critical value flag Gtr1 flag of atransformation unit of a subset of a luminance component may bedetermined by using a context set index ctxset and a context offset c1.That is, a context index ctxIdx1 indicating one of the total of 16contexts that are to be applied to the first critical value flag Gtr1flag of a transformation unit of a subset of a luminance component maybe determined by the following equation: ctxIdx1=ctxSet*4+c1.

Similarly, when referring to FIG. 24, for entropy encoding and decodingof a first critical value flag Gtr1 flag with respect to atransformation unit included in a chrominance component subset, acontext included in any one of context sets from a total of two contextsets, a first context set 2410 and a second context set 2420 is used.Different contexts included in a context set may be distinguished by acontext offset c1_chroma from 0 to 3 as illustrated in FIG. 24. As willbe described later, a context offset c1_chroma regarding a chrominancecomponent may be set in the same manner as the context offset c1 withrespect to a luminance component. A context index ctxIdx1_chromaindicating one of a total of 8 contexts that are to be applied to afirst critical value Gtr1 flag of a transformation coefficient includedin a subset of a chrominance component may be determined by thefollowing equation: ctxIdx1_chroma=ctxSet*4+c1_chroma.

FIG. 27A illustrates a context set index ctxset for determining acontext set used in entropy encoding and decoding of a first criticalvalue flag Gtr1 flag and a second critical value flag Ctr2 flag of asignificant coefficient of a luminance component and a significantcoefficient of a chrominance component according to an embodiment of thepresent invention.

Referring to FIG. 27A, the context set obtaining unit 2220 obtains acontext set index ctxset for determining a context set used in entropyencoding and decoding of a first critical value flag Gtr1 flag and asecond critical value flag Gtr2 flag based on color componentinformation of a transformation unit, location information of a currentsubset that is being processed, and whether there are significantcoefficients having a value greater than a first critical value in asubset that is processed before the current subset.

For example, regarding a transformation coefficient included in atransformation unit of a luminance component, if there is no significantcoefficient having a value greater than 1, in a previously processedsubset (NoGreatT1), and when entropy encoding or decoding a firstcritical value flag of a significant coefficient included in a subsetlocated at the upper side of the leftmost location, a first context set(ctxset=0) is obtained. If there is no significant coefficient having avalue greater than 1, in a previously processed subset (at least oneGreatT1), and when entropy encoding or decoding a first critical valueflag of a significant coefficient included in a subset (subset 0)located at the upper side of the leftmost location, a second context set(ctxset=1) is obtained. If there is no significant coefficient having avalue greater than 1, in a previously processed subset (No GreatT1), andwhen entropy encoding or decoding a first critical value flag of asignificant coefficient included in a subset (other subsets) that is notlocated at the upper side of the leftmost location, a third context set(ctxset=2) is obtained. Also, if there is a significant coefficienthaving a value greater than 1, in a previously processed subset (atleast one GreatT1), and when entropy encoding or decoding a firstcritical value flag of a significant coefficient included in a subset(other subsets) that is not located at the upper side of the leftmostlocation, a fourth context set (ctxset=3) is obtained.

Regarding a transformation coefficient included in a transformation unitof a chrominance component, a context set is obtained based only onwhether there is a significant coefficient having a value greater than 1in a previously processed subset. In other words, if there is nosignificant coefficient having a value greater than 1 in a previouslyprocessed subset (No GreatT1), a first context set (ctxset=0) isobtained, and if there is a significant coefficient having a valuegreater than 1 in a previously processed subset (at least one GreatT1),a second context set (ctxset=1) is obtained.

FIG. 27B illustrates a context offset used in entropy encoding anddecoding of a first critical value flag Gtr1 flag and a second criticalvalue flag Gtr2 flag according to an embodiment of the presentinvention.

Referring to FIG. 27B, in order to determine a context included in acontext set used in entropy encoding and decoding of a first criticalvalue flag Gtr1 flag, a first context offset (c1=0) is obtained forsignificant coefficients having a value greater than 1 from amongsignificant coefficients included in a subset. Regarding a significantcoefficient that is processed the first time from among significantcoefficients included in a subset, a second context offset (c1=1) isobtained. If a length of a previous transformation coefficient havingconsecutive 1s is 1, a third context offset (c1=2) is obtained. If alength of a previous transformation coefficient having consecutive 1s is2 or more, a fourth context offset (c1=3) is obtained. A context offsetc1 may be applied to both a luminance component and a chrominancecomponent.

A context offset c2 used in entropy encoding and decoding of a secondcritical value flag Gtr2 flag has a value of 0.

When a context set index ctxset and a context offset c1 or c2 forentropy encoding or decoding of a transformation coefficient included ina subset is determined based on tables of FIGS. 27A and 27B, the contextdetermining unit 2240 determines a context index ctxIdx1 indicating oneof the total of 16 contexts that are to be applied to the first criticalvalue flag Gtr1 flag of a transformation unit included in a subset of aluminance component according to the following equation:ctxIdx1=ctxSet*4+c1. Also, the context determining unit 2240 determinesa context index ctxIdx1_chroma indicating one of the total of 8 contextsthat are to be applied to the first critical value flag Gtr1 flag of atransformation unit included in a subset of a chrominance componentaccording to the following equation: ctxIdx1_chroma=ctxSet*4+c1.Referring to the tables of FIGS. 27A and 27B, a total of contextsapplied to the first critical value flag Gtr1 flag is 24, that is,4×4=16 for a luminance component plus 2×4=8 for a chrominance component.

Also, the context determining unit 2240 determines a context indexctxIdx2 indicating a context to be applied to the second critical valueflag Gtr2 flag according to the following equation: ctxIdx2=ctxset*1+c2.That is, a context index ctxIdx2 indicating a context to be applied to asecond critical value flag Gtr2 flag is set to be the same as a value ofa context set index ctxset. Accordingly, referring to the tables ofFIGS. 27A and 27B, a total of contexts applied to the first criticalvalue flag Gtr1 flag is 8, that is, 4 for a luminance component and 4for a chrominance component.

FIG. 25 is a flowchart illustrating a method of determining a contextmodel for entropy encoding and decoding of a transformation coefficientlevel, according to an embodiment of the present invention.

Referring to FIG. 25, in operation 2510, the mapping unit 2210 splits atransformation unit into subsets having a predetermined unit and obtainsa significant coefficient that is included in each subset and is not 0.As described above, when performing entropy encoding, the mapping unit2210 may obtain a location of a significant coefficient from informationof transformation coefficients included in a current subset. Whenperforming entropy decoding, the mapping unit 2210 may obtain a locationof a significant coefficient included in a subset from a significancemap SigMap.

In operation 2520, the context set obtaining unit 2220 obtains a contextset index ctxset for determining a context set used in entropy encodingand decoding of a first critical value flag indicating whether asignificant coefficient from among a plurality of context sets includinga plurality of contexts has a value greater than a first critical value,based on color component information of a transformation unit, alocation of a first subset that includes a significant coefficient andis currently processed, and whether there is a significant value greaterthan a predetermined first critical value in a second subset that isprocessed before the first subset. As shown in FIGS. 27A and 27B, thecontext set obtaining unit 2220 may obtain a context set ctxsetindicating a context set from among four context sets depending onwhether the location of a first subset is a subset0 located at the upperside of the leftmost position, and whether there is a significantcoefficient having a value greater than 1 in a previously processedsubset, regarding a transformation coefficient included in atransformation unit of a luminance component. Also, the context setobtaining unit 2220 may obtain a context set ctxset indicating one oftwo context sets based only on whether there is a significantcoefficient having a value greater than 1 in a previously processedsubset, with respect to a transformation coefficient included in atransformation unit of a chrominance component.

In operation 2530, the context offset obtaining unit 2230 obtains acontext offset for determining one of a plurality of contexts includedin a context set used in entropy encoding or decoding of the firstcritical value flag Gtr1 flag based on a length of a previoustransformation coefficient having consecutive 1s. As described above,the context offset obtaining unit 2230 may determine a context offset c1based on a length of the previous transformation coefficient consecutive1s before a current significant coefficient that is processed whilesignificant coefficients included in the current subset are beingprocessed according to a predetermined scanning order. The contextoffset obtaining unit 2230 may set a context offset c2 which is used inentropy encoding and decoding of a second critical value flag Gtr2 flag,to always have a value of 0 without having to consider other parameters.

In operation 2540, the context determining unit 2240 obtains a contextindex ctxIdx1 indicating a context used in entropy encoding and decodingof a first critical value flag Gtr1 flag by using a context set indexctxset and a context offset c1. As described above, when assuming that acontext set index ctxSet indicating one of n context sets are integersfrom 0 to (n-1), and a context offset c1 indicating one of m contextoffsets is an integer from 0 to (m-1), the context determining unit 2240may determine a context index ctxIdx1 indicating one of n*m contextsaccording to the following equation: ctxIdx1=ctxSet*m+c1. Also, thecontext determining unit 2240 may determine a context index ctxIdx2indicating one context used in entropy encoding and decoding of a secondcritical value flag Gtr2 flag according to the following equation:ctxIdx2=ctxSet*1+c2. Since c2 is 0, the context index ctxIdx2 indicatinga context used in entropy encoding and decoding of the second criticalvalue flag Gtr2 flag is determined based on only a value of a set indexctxSet.

FIG. 26 is a detailed flowchart illustrating a method of determining acontext model for entropy encoding and decoding of a transformationcoefficient level, according to an embodiment of the present invention.

Referring to FIG. 26, in operation 2611, the mapping unit 2210 splits atransformation unit into subsets having a predetermined size and obtainsa significant coefficient that is included in each subset and is not 0.

In operation 2612, the context set obtaining unit 2220 determines acontext set index ctxset based on a location of a current subset andcolor component information from among parameters that are used indetermining a context set. For example, when a current subset is asubset0 located at the upper side of the leftmost position of atransformation unit or a chrominance component, ctxset is set to 0, andif the current subset is not a subset located at the upper side of theleftmost position of a transformation unit and is a luminance component,ctxset is set to 2.

In operation 2613, the context set obtaining unit 2220 determineswhether there is a significant coefficient having a value greater than afirst critical value in a previous subset that is processed just beforea current subset. As a result of determining in operation 2613, if thereis a significant coefficient having a value greater than a firstcritical value in a previous subset, the context set obtaining unit 2220increases a value of the context set index ctxset set in operation 2612by 1; otherwise, if there is no significant coefficient having a valuegreater than the first critical value in a previous subset, the contextset obtaining unit 2200 maintains the value of the context set indexctxset set in operation 2612.

In operation 2615, the context offset obtaining unit 2230 sets a valueof the context offset c1 to be applied to a first critical value flagGtr1 flag of a significant coefficient of a current subset that isprocessed the first time.

In operation 2616, the context determining unit 2240 obtains a contextindex ctxIdx1 indicating a context to be applied to a first criticalvalue flag Gtr1 flag according to the following equation:ctxIdx1=ctxset*4+min(c1, 3). The equation is based on FIGS. 27A and 27B,and a length of a transformation coefficient having consecutive 1s isdetermined in operation 2620, and the context offset index c1 may have avalue of 3 or greater if at least three first critical value flags Gtr1flag continue. However, referring to FIG. 27B, as a context offset indexc1 is set to have a value of 2 for at least two consecutive 1s,min(c1,3) is used to limit the context offset index c1 to not be greaterthan 3.

In operation 2617, the regular coding engine 2132 and the regulardecoding engine 2170 entropy encode or decode the first critical valueflag Gtr1 flag based on a context model indicated by the obtainedcontext index ctxIdx1.

In operation 2618, the context offset obtaining unit 2230 determineswhether a first critical value flag Gtr1 flag, which is currentlyencoded or decoded, has a value of 0 and whether a context offset indexc1 is 0 or not. The determination operation of operation 2618 isperformed in order to determine the number of consecutive 1s from amongsignificant coefficients. As a result of determining in operation 2618,if a currently encoded or decoded first critical value flag Gtr1 flaghas a value of 0 and the context offset index c1 is not 0, the contextoffset index c1 is increased by 1 in operation 2620. As a result ofdetermining of operation 2618, otherwise, if the currently encoded ordecoded first critical value flag Gtr1 flag does not have a value of 0or the context offset index c1 is 0, the context offset index c1 isreset to 1 in operation 2619.

Operations 2615 through 2620 are operations of entropy encoding ordecoding of a first critical value flag Gtr1 flag of a transformationcoefficient included in a subset. To speed up the operations, instead ofentropy encoding or decoding a first critical value flag Gtr1 flagregarding every significant coefficient, only a first critical valueflag Gtr1 flag may be entropy encoded or decoded for only apredetermined number (#) of significant coefficients from the lastsignificant coefficient (max loop #). Level information of a significantcoefficient whose corresponding first critical value flag Gtr1 flag isnot entropy encoded or decoded is itself entropy encoded or decoded.

In operations 2621 through 2624, a second critical value flag Gtr2 flagis entropy encoded or decoded.

In operation 2621, whether a value of a context offset index c1 is 0 ornot is determined. In operation 2622, a context index ctxIdx2 indicatinga context to be applied for entropy encoding or decoding of a secondcritical value flag Gtr2 flag is set to be the same as the context setindex ctxset that is determined when entropy encoding and decoding ofthe first critical value flag Gtr1 flag is performed.

In operation 2623, the regular coding engine 2132 and the regulardecoding engine 2170 entropy encode or decode a second critical valueflag Gtr2 flag based on a context model indicated by the obtainedcontext index ctxIdx2.

In operation 2624, the regular coding engine 2132 and the regulardecoding engine 2170 entropy encode or decode levels of transformationcoefficients having a value greater than a second critical value. Asdescribed above, a level value (level-3) obtained by subtracting apredetermined value from a corresponding transformation coefficient maybe entropy encoded or decoded.

FIG. 28 illustrates a table including transformation coefficientsincluded in a subset of FIG. 20 and context offset indices c1 used inentropy encoding or decoding of transformation coefficient informationthat is entropy encoded or decoded, according to an embodiment of thepresent invention. As described above, the context offset index c1 isdetermined based on a length of a transformation coefficient having acontinuous value of 1 and is assumed as being obtained according to FIG.27B. Also, in FIG. 28, the order of processing of each transformationcoefficient is assumed to be from left to right.

Also, here, it is assumed that the context set index ctxset for entropyencoding and decoding of a first critical value flag Gtr1 flag and thesecond critical value flag Gtr2 flag is determined not based oninformation about a transformation coefficient included in a currentsubset but based on color component information of a transformationunit, a location of the current subset, and whether there is asignificant coefficient having a value greater than a first criticalvalue in a subset that is processed before the current subset.

Referring to FIG. 28, a last significant coefficient (2810) of 1 of asubset that is initially processed is not greater than 1, and thus thefirst critical value flag Gtr1 flag has a value of 0. Referring to FIG.27B, a context offset c1 with respect to the first critical value flagGtr1 flag of an initially processed significant coefficient is set to 1.

Next, the context offset c1 regarding the first critical value flag Gtr1flag of a next significant coefficient of the last significantcoefficient (2810) has a value of 2. This is because there is previouslythe significant coefficient (2810) having a value of 1. Similarly, thecontext offset c1 regarding the first critical value flag Gtr1 flag of anext significant coefficient 2830 has a value of 3. This is becausethere are previously two significant coefficients (2810 and 2820) havingtwo consecutive 1s.

As the first critical value flag Gtr1 flag of the significanttransformation coefficient (2830) has a value of 1, the context offsetc1 regarding the first critical value flag Gtr1 flag of all significantcoefficients after the significant coefficient (2380) has a value of 0.This is because, if the first critical value flag Gtr1 flag has a valueof 1 as a result of determining in operation 2618 of FIG. 26, thecontext offset c1 is set to 0 according to operation 2619, and thus thecontext offset c1 regarding all significant coefficients thereafter areset to 0.

FIG. 29 illustrates a table including transformation coefficientsincluded in a subset and context offset indices c1 used in entropyencoding or decoding of transformation coefficient information that isentropy encoded or decoded, according to an embodiment of the presentinvention. Comparing FIGS. 28 and 29, FIG. 29 is different from FIG. 28in that a last significant coefficient (2910) has a value of 2, and thusthe first critical value flag Gtr1 flag has a value of 1. As describedabove, the context offset c1 regarding the first critical value Gtr1flag of the significant coefficient 2910 that is initially processed isset to 1, and the context offset c1 regarding the first critical valueflag GTR1 of all significant coefficient after the significantcoefficient (2910) has a value of 0.

According to the method and apparatus for determining a context modeldescribed above, a context set index ctxset is obtained based on colorcomponent information of a transformation unit, a location of a currentsubset, and whether there is a significant coefficient having a valuegreater than a first critical value in a subset that is processed beforethe current subset, and a context offset c1 is obtained based on alength of a previous transformation coefficient having consecutive 1s.

The standards for determining a context index or a context offset arenot limited to the embodiments of the present invention and may also bemodified.

For example, as illustrated in Table 2, a context offset c1 may be setby grouping cases where probabilities are similar.

TABLE 2 Context offset (C1) 0 1 or more larger than 1 1 Initial—notrailing ones 2 1 or 2 trailing one 3 3 or more trailing one

When comparing Table 2 and FIG. 27B, Table 2 shows one continuoustransformation coefficient and two continuous transformationcoefficients that are grouped in a same group to be mapped in onecontext. Also, context set may be grouped based on the number ofsignificant coefficients equal to or greater than 1 included in aprevious subset so as to set a context set index ctxset as shown inTable 3.

TABLE 3 Context set index (ctxset) (with respect to a previous subset) 0Initial—no larger than one 1 1 or 2 larger than one (one or twosignificant coefficients) 2 3 or more larger than one (three or moresignificant coefficients)

In addition, a context set index ctxset may be set as shown in Tables 4through 6 below based on the type of a slice in which a currenttransformation unit is included, a location of a current subset, and thenumber of transformation coefficients having a value equal to or greaterthan a predetermined critical value in a previous subset.

TABLE 4 Context set (ctxset)/in the case of slice I 0 Subset0 0 largerT1 in previous subset 1 1-4 larger T1 in previous subset 2 >4 larger T1in previous subset 3 other 0-1 larger T1 in previous subset 4 subsets1-4 larger T1 in previous subset 5 >4 larger T1 in previous subset

Referring to Table 4, for example, when entropy encoding or decodingtransformation coefficients included in a subset of a transformationunit included in a slice I, one of 0 to 5 may be set as a value of acontext set index ctxset based on a location of a current subset,whether the number of a transformation coefficient having a value equalto or greater than a predetermined critical value T1 in a previoussubset is 0, 1 to 4, or more than 4.

TABLE 5 Context set (ctxset)/in the case of slice P 0 Subset0 0 largerT1 in previous subset 1 1-3 larger T1 in previous subset 2 >3 larger T1in previous subset 3 other 0 larger T1 in previous subset 4 subsets 1-3larger T1 in previous subset 5 >3 larger T1 in previous subset

Referring to Table 5, for example, when entropy encoding or decodingtransformation coefficients included in a subset of a transformationunit included in a slice P, one of 0 to 5 may be set as a value of acontext set index ctxset based on a location of a current subset,whether the number of a transformation coefficient having a value equalto or greater than a predetermined critical value T1 in a previoussubset is 0, 1 to 3, or more than 3.

TABLE 6 Context set (ctxset)/in the case of slice B 0 Subset 0 larger T1in previous subset 1 0 1-2 larger T1 in previous subset 2 >2 larger T1in previous subset 3 other 0 larger T1 in previous subset 4 subsets 1-2larger T1 in previous subset 5 >2 larger T1 in previous subset

Referring to Table 6, for example, when entropy encoding or decodingtransformation coefficients included in a subset of a transformationunit included in a slice B, one of 0 to 5 may be set as a value of acontext set index ctxset based on a location of a current subset,whether the number of a transformation coefficient having a value equalto or greater than a predetermined critical value T1 in a previoussubset is 0, 1 or 2, or more than 2.

The embodiments of the present invention can be written as computerprograms and can be implemented in general-use digital computers thatexecute the programs using a computer-readable recording medium.Examples of the computer-readable recording medium include magneticstorage media (e.g., ROM, floppy disks, hard disks, etc.) and opticalrecording media (e.g., CD-ROMs or DVDs).

While the present invention has been particularly shown and describedwith reference to exemplary embodiments thereof, it will be understoodby those of ordinary skill in the art that various changes in form anddetails may be made therein without departing from the spirit and scopeof the present invention as defined by the following claims.

1. A method of determining a context model for entropy decoding, themethod comprising: determining a current transform unit based on sizeinformation of a transform unit, wherein the size information isobtained from a bitstream; when the current transform unit is atransform unit of a chroma component, determining a context set of acurrent sub-block in the current transform unit independently of aposition of the current sub-block determining a context model forobtaining first threshold value information based on the context set ofthe current sub-block; and determining a context model for obtainingsecond threshold value information based on the context set of thecurrent sub-block, wherein the first threshold value informationindicates whether a current transformation coefficient in the currentsub-block is greater than a first threshold value, and the secondthreshold value information indicates whether a current transformationcoefficient in the current sub-block is greater than a second thresholdvalue.
 2. The method of claim 1, the method further comprising: when thecurrent transform unit is a transform unit of a luminance component,determining the context set of the current sub-block in the currenttransform unit based on the position of the current sub-block.
 3. Themethod of claim 1, wherein the first threshold value is 1, and thesecond threshold value is 2.